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ABSTRACT 


The polarization of an X-ray beam that produces electrons with velocity components 
perpendicular to the beam generates an azimuthal distribution of the ejected electrons. 
We present methods for simulating and for analyzing the angular dependence of electron 
detections which enable us to derive simple analytical expressions for useful statistical 
properties of observable data. The derivations are verified by simulations. While we 
confirm the results of previous work on this topic, we provide an extension needed for 
analytical treatment of the full range of possible polarization amplitudes. 


Subject headings: methods: data analysis — method: statistical — polarization - 
X-rays: general 


1. INTRODUCTION 

X-ray polarimeters that are based on Compton scattering or the photoelectric effect lead to a 
distribution of events assigned to azimuthal angles in the plane perpendicular to the X-ray beam. 
The events are detected photons in the case of scattering experiments and detected electrons in 
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the experiments using the photoelectric effect. As in the case of polarization of optical and longer 
wavelengths, the polarization of the incoming radiation leads to a distribution 

= I f + Uf cos(2</>) + Q f sin (2</>), (1) 

where If, Uf, and Qf describe the distribution of electrons and should reflect the Stokes parameters 
of the incoming X-ray radiation. In general, detector characteristics and background cause the 
modulation of the signal to differ from that of the incoming beam. In this paper we address 
only the observed modulation. It would be that of the incoming beam for a perfect detector with 
negligible background. 


Then the polarization amplitude and angle are 


a e = {Uf + Q/) 1/2 /If 

(2) 

and the polarization angle estimate is 

4> e = ^arctan (Q f /Uf). 

(3) 


Recently the appropriate err ors on the determ ination of X-ray polarization wer e discussed by 


Strohmaver fe Kallman ( 2013, here after Paper l uElsner, O’Dell fc Weisskoof ( 201 2l l . I Weisskoof’. Eisner. &; O’Dell 


(2010), and Weisskopf et al (2009). The best estimates and errors on the measurements of optical 
infrared, and radio polarization have been discussed in a number of publications taking int o account 


the t e chniques used for the mea s urements and their associated experimental e r rors (Vaillan court 


20061 : ISimmons fa Steward Il985t IClarke et al 1 198.1 : IWardle fa Kronberd Il974l: ISerkowskil Il962h . 


While the polarization amplitude and angle are intuitively the focus of attention, they are not 
linearly related to the measurables and not independent. It has been pointed out that the Stokes 
Uf and Qf are more amenable to statistical treatment. The same is true for X-ray polarization. 


It has often been assumed that Uf/If and Qf/If are independent, normally distributed variables. 
These then determine the probability of measured amplitude a e and position angle 4> e , given the 
true amplitude ao and position angle 0 q: 

P(N, a e , a 0 , 4> e , 4 >q) = exp[—^-(ag + a 2 0 - 2a e a 0 cos 2{cj) e - <fo))]. (4) 

47r 4 

Strohmayer fa Kallman (2013) did extensive simulations for the range of polarization amplitudes 
0 — 1. While there was agreement for small amplitudes, it was found that the simulation results 
differed as the amplitude approached 1. 


We also carried out simulations, with a different approach, but also assuming Poisson errors on the 
number of events. We have kept the same convention for Uf an d Q r as used in Paper I, though other 
conventions are used, for example in radio astronomy (Ham aker fa Bregman 19961 ). We likewise 
see divergence from Equation [4] for the joint probability in angle and amplitude. It is the purpose of 
this paper to point out that the divergence we find has a simple description that can be analytically 
derived. In Section 2 we describe our simulations and the analytical description. In Section 3 we 
ask what the confidence regions would be for the polarization, given a measurement. Section 4 
summarizes the results. 
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2. GENERATION AND ANALYSIS OF SIMULATED DATA 

We consider the case of measurements using the photoelectric effect. The partially polarized incident 
beam of X-rays can scatter electrons into various azimuthal angles around the axis of the incident 
beam. We want to estimate the degree of polarization in the incident beam, and its direction, 
from the observed numbers of electrons scattered into different azimuthal angles, and also the 
uncertainties in those estimates. 

The polarized component of the beam scatters electrons into an azimuthal angle cf), measured from 
the direction of the incident polarization, with a probability proportional to cos 2 ((f)), while the un¬ 
polarized component of the beam scatters electrons isotropically. In general, if the true polarization 
is at an angle <f>o in a coordinate system, we would replace (f) by ((f) — <f> o). 

So we suppose that the number of scattered electrons reaching detectors, per unit time, at an 
azimuthal angle between (j) and ((f) + d(f>), can be expressed as proportional to [Iq + Uq cos(2 (f))]d(j). 

This form ranges from Iq — Uq to Iq + U$ and has an amplitude of variation ao = Uq/Iq. 
ao is one of the physically significant quantities that we would like to estimate. 

The detectors are taken to divide the interval from —7 r to +7T into M equal angular bins, each of size 
27 t/M radians. M is an odd integer large compared with unity. The bins can be labelled with an 
index j which runs from —(M— l)/2 to +(M — l)/2, with the center of the jth bin at (f>j = 2irj/M. 

The expected number of counts in the jth bin in a time T is 

( nj ) = ^[/o + tfocos(2^-)] = ^[l + a 0 cos(2^)]. (5) 

k includes geometric and efficiency factors; it is proportional to T. 

The values of rij are independent Poisson-distributed random variables, with these mean values 
(and variances). The total number of counts 

N = Y j n j ( 6 ) 

j 

is then Poisson-distributed with a mean (and variance) equal to 

(N) = kIq. (7) 

So a set of samples generated with the same ( N ) will have different total count numbers. 

The mathematical properties of the rij values enable deriving a number of useful facts about the 
statistical properties of various quantities related to them. 

To estimate the incident polarization amount and direction from an observed set of rij values, we 
seek to represent their angular distribution by a function describing the counts per unit angle. 

/(</>) = If + C//cos(20) +Qf sin( 2 4 >). 


( 8 ) 
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Then the amount and direction of the incoming beam can be estimated in the usual way. For the 
amplitude of polarization the estimate is 


= (t/J + Q?) ,/2 /// 


and the polarization angle estimate is 


4> e = -arctan (Qf/Uf). (10) 

The arctangent function in this equation is actually the two-argument arctangent function [repre¬ 
sented in many programming languages as arctan 2(Qf t , Uf)]. Technically, it is the principal value 
of the argument of the complex number Uf + iQf. Its values range from —7 r to +7T radians, so </> e 
ranges from —90° to +90°. 

Multiplying the function f(<p) by any constant produces no change in the amplitude and angle 
estimates. 

There is a useful graphical representation of the results. To simplify the notation somewhat, let U' 
stand for Uf/If and Q' stand for Qf/If. Then the amplitude and angle estimates are just given by 

a e = VU' 2 + Q' 2 (11) 

and 

4> e = — arctan(Q' /U r ). (12) 

If we think of U' and Q' as the rectangular coordinates of a point, these are the polar coordinates 
of that point: a e is the distance of the point from the origin and 2 <j> e is the angle between the line 
from the origin to the point and the U' axis. It will be seen that it can be convenient to discuss the 
geometry of U' and Q 1 in the U' — Q' plane and then, if desired, represent the results in plots of (j) e 
and a e . 

Choosing values of If, Uf, and Qj that best represent the angular distribution of the set of re¬ 
values could be done in several ways. If the amount of data is large enough, the results should be 
essentially the same. One approach would be to find the values that make the function / the best 
fit, in some sense, to the points. This was used in Paper 1, using chi-squared fitting. 

Another approach, which is used here, is to view / as part of a discrete trigonometric interpolating 
polynomial, where the terms in the polynomial series are the ones that reflect the properties of the 
distribution that are of physical interest. We choose to express the angular distribution as 

= 7^ n j ( 13 ) 

Then If, Uf, and Qf are the three coefficients that reflect the polarization of the incoming radiation 
and that enable estimates of its direction and amplitude. 


The coefficients If , Uf , and Qf are sums over the angles cj>j: 


( 14 ) 
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U t = 


7 r 


Q f = 


1 

7r 


Wj COs(2(/>j) 
j 

yy 7ij sin(2(/)j) 


j 

The coefficients If, Uf, and Qf are random variables, the sums of linear combinations of the 
random rij values. From the Lyapunov central limit theorem their distributions become normal as 
M becomes large enough, with means and variances which can be calculated using the sums over 
j of polynomials in cos(2 (f)j) and sin(2 M does not appear in the limit. Our simulations were 
insensitive to M above 15. 


The mean value of If is 


(If) = = “VI 2 *) = <W>/(2tt) 

3 


(15) 


The other mean values are 


The variance of If is 


(U f ) = kUq/ (27t) = (N)a 0 /(2ir) 

(Qf) = o 


Var ( I f) = ^2 S Var ( n j) = ^2 Z)< n j> = ( N )/( 4yr2 ) 

j j 

and by similar calculations the variances of Uf and Qf are each equal to 


(16) 


(17) 


= «/ 0 /( 2tt 2 ) = (N)/( 2tt 2 ). (18) 

The covariance of Qf with either If or Uf is zero, but the covariance of Uf with If is ( N)ao/4ir 2 
and the correlation of Uf and If is ao/y/2. Examples of correlation of Uf with If and Qf with If 
were shown in Paper 1. For cf >q 7 ^ 0, the correlation of Qf with If is also nonzero. 


The coefficients If, Uf, and Qf are not statistically independent, since they are calculated from the 
same set of rij values. In fact If, Uf, and Qf are trivariate normal. This is not directly useful since 
the two quantities of interest are the ratios Uf/If and Qf/If. Correct distributions for U' and Q' 
and thus for a e and (j) e can be generated by simulations in which the M values of nj are randomly 
generated for each sample, and the resulting values of If, Uf, Qf and thus U' and Q', can be used 
to get a e and <f> e . 


The points for a collection of samples will be centered around the U', Q' point with coordinates ao, 
0. Let us call that point Z. The probability density for points, that is, the probability per unit 
area in this plane, describes what fraction of the points from a set of many samples will have those 
locations. A line of constant probability, a closed curve enclosing the point Z , identifies the region 
within which a specific fraction of samples will lie, and thus the likelihood of that set of amplitude 
and angle estimates. Figure 1 shows an example for a low true amplitude. 
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Simplifications result if the variables U' and Q' are treated as if they were statistically independent 
random variables with known means and variances. In fact the available information allows a 
calculation of the marginal distribution of U' and Q' which shows that for large (N) they are 
independent. Details are given in Appendix [A] which also provides the joint distribution of U' and 
Q' before any approximation of large (N). 

The calculation also provides the means and variances of U' and Q'\ 

(U')=a 0 - <00=0; 

Var(U') = ^r(l - ajj/2) ; Var(Q') = ^y. 

These results are confirmed by the results of simulations for the full range 
0 to 1. 

The probability density for a system point would be the product of the probability density for JJ' 
and the probability density for Q' . But these are simply Gaussians with the derived variances and 
means. Figure 2 shows the results of simulations for a true polarization oo = 3/4, along with the 
Gaussian distributions with the calculated means and variances. 


(19) 

( 20 ) 

of amplitude values from 


With the definition of A as the ratio of the standard deviations: 


n2 _ a Q' _ 
" r 2 


a U' 1 a 0 


( 21 ) 


and denoting an / simply as a', the probability per unit area in the U'Q 1 plane is also a Gaussian 
(see Equation (jA.9[) ): 


A 


Pu'Q'iU ', Qf) = 7^2 exp[-(C/' - o 0 ) 2 A 2 /2u / 2 ] • exp(-Q' 2 /2u /2 ) 


( 22 ) 


A D 2 

ex P(-TT72)> 


with 


27rcr' 2 2a ' 2 ' 

D 2 = (U 1 -a 0 ) 2 X 2 + Q' 2 . 


(23) 


It is worth noting that while these results were derived with the point Z on the U' axis (i.e., with 
4>o = 0) they can be expressed in a way that is more general. Let us define 2 rj as the angular 
difference between the location of Z and the location of a data point. When Z is on the U' axis, 
r ] is just (j) e . But in general U' in the above equations is a e cos(2r/) while Q' is a e sin(2?y). Then 
Equation (1231) becomes 


D 2 


[q e cos(2r/) - qq] 2 

1 - Oq/2 


+ a 2 sin 2 ( 277 ). 


(24) 


In this form it involves only the distances of Z and the data point from the origin and the angle 
between them, and is correct for any choice of a reference direction. 
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Lines of constant probability in the U' — Q' plane are ellipses, centered on the point Z, with minor 
axes along the line from the origin to the point Z, and the ratio of major to minor axes equal to A. 
An example is shown in Figure 3. D is the semi major axis of the ellipse. The area within an ellipse 
with a given D is ttD 2 /X. The total probability of a point lying outside the ellipse with a given D 
is just exp(— D 2 /2a' 2 ) (as long as D 2 (( 1.) See Figure 4 for an example of agreement between the 
probability distribution and a simulation. 

This provides an extension of previous work, covering the full range of incident amplitudes, even 
when the correlation of If with Uf changes the variance of U' enough to be important. The results 
for U' and Q' also give 


Var(a e ) 

Var(ry) 


a' 2 (l-a§/2), 

1 / 

4 a o " 


(25) 

(26) 


Since A is quite close to unity unless the incident polarization amplitude is quite large, however, a 
simpler approximation should often be applicable. Setting A = 1 reduces D 2 to 

R 2 = {U' - a 0 ) 2 + Q' 2 (27) 

This is simply the square of the distance from the point with rectangular coordinates U' and Q' to 
the point Z. The probability density is 

Pu'Q'(U\ Q') = —^ exp(—i? 2 /2cr /2 ). (28) 

The lines of constant probability are simply circles centered at Z. The probability that a point is 
farther from Z than R is exp(— R 2 /2a' 2 ). 


It is straightforward to reexpress this in terms of the polar coordinates a e and 2 cf> e , noting that the 
element of area in these coordinates is a e ■ d(a e ) • d{2(j) e ) . Also, the variance a' 2 = 2/(N) can be 
replaced by 2/N if the number of counts is large. The result is 


P(a e , 4> e ) = exp[-^(a 2 + a 2 0 - 2 a e a 0 cos(20 e ))] 

47T 4 


(29) 


which is the formula common ly used in discussions of this topic (jWeisskopf. Eisner. &; O’Dell 


2010 


Strohmaver fc Kallmanl 2013), the same as Equation(j4|) with = 0. 


As pointed out by those authors, if the angle is not of interest, but only the amplitude of the 
polarization, the integral over angles gives the Rice distribution ({Rice 19_45) for the amplitude, a 
distribution used in various other signal processing applications. If, however, ao is not small and 
Equation (1241) is app licable, the angular d i stribution is in the category of generalizations of the von 
Mises distribution (jMardia &; Jupp 1 19991 : lYfontis fe Borgman Il982l ) . It has been used in studies 
of Brownian motion, waves, and bending of biological molecules, for examples. The integral over 
angle gives a more complex distribution in a e . This can certainly be numerically computed, if a 




















single parameter confidence region is desired. In this paper we are concentrating on joint confidence 
regions in the polarization amplitude and direction. 


Comparison to the work of S trohm aver &; K allma n (2013j, Paper 1) finds some differences in the 
variances of Uf and Qf and their correlations with If for the variables calculated in different 
ways. For a given simulation the results of calculating Uf and Qf from trigonometric interpolating 
polynomials agree with the values obtained from unweighted least squares fitting (in accordance 
with Parseval’s theorem), but differ from the results of least squares weighted inversely with the 
counts in the bins (which exceeded 100 for all bins in our simulations). For the unweighted least 
squares, the variances of Uf and Qf are independent of polarization and the correlations strictly 
linear, while for the weighted least squares, the variances decrease with amplitude. This in turn 
causes an amplitude dependent increase in the correlation magnitudes. It does not seem surprising 
that there is a difference in the distribution of the best fits and the shapes of the probability surfaces. 
The mean values of the fits are consistent. 


Some details of our simulations and calculations are given in Appendix [Bj 


3. DRAWING CONCLUSIONS FROM AN OBSERVED DATASET 

We turn now to the question of how we can draw conclusions about the magnitude and direction 
of incoming X-rays from a single data set. We have the number of counts at each azimuthal angle 
with respect to some angle on the sky. We calculate If, Uf, and Qf for our angular distribution 
of counts, and from them we compute two quantities: U' and Q' . We can think of these as the 
rectangular components of a point in a coordinate system with the U' axis in our reference direction. 
The origin of this coordinate system is the point corresponding to a completely unpolarized beam, 
for which both components would vanish. 

What we do not know but would like to draw conclusions about is the location of the point Z in 
this coordinate system. By definition, Z is the point whose distance from the origin is the true 
polarization amplitude ao and which is in the same direction as the true polarization direction. 

A contour line enclosing the observed data point can be defined by the fact that the probability 
of the observed U' and Q' values for any Zc (the C stands for “candidate”) located at a point on 
that contour has the same value. Any point inside the contour is a location for Zc that gives a 
probability of the data that is higher, and any point outside the contour is a location for Zc such 
that the data has a lower probability. This defines a confidence region for Zc- 

We want to find the coordinates of the points Zc for which the probability of the data point, with 
coordinates U' and Q' , has a specified probability. The geometry of this situation is very similar to 
the case previously studied, where one wanted to find which data points have a specified probability 
when the true amplitude is known. We define the angle 2 rjc as the angle between lines from the 
origin to the data point and to Zc, the difference between the angular position of the data point 
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and the angular position of the point Zq- 


In order to have a specified probability, the data point must lie somewhere on an ellipse whose 
center is at the point Zq, whose major axis is perpendicular to the line from the origin to Zq and 
has the length 2D, and whose minor axis is 2D/Xc, with A c = 1 /\Jl — 4/2. D is related to the 
probability in the same way as in the previous discussions; it depends only on the total number of 
counts in the data set and the specified probability. 


From the earlier section, the requirement for the desired probability is given by Equation (|24J) . which 
can be rewritten for the present purpose as 


D 2 


[a e cos(2 rjc) ~ ac ] 2 

1 - 4/2 


+ a 2 e sin 2 (2 rj c ). 


(30) 


We define a pair of new variables 

u = ac cos(2r]c) ; v = ac s\n{2iqc) (31) 

which specify the location of Zq relative to the data point. Using these, the condition that the data 
point have the correct position relative to Zc can be written as 

u 2 (l + D 2 / 2) - 2 a e u + v 2 [l + [D 2 - a 2 )/2] = D 2 - a 2 (32) 

This is a quadratic in u and v and thus defines an ellipse. It is even in v, so the axes lie along 

and perpendicular to the line from the origin to the data point. The quadratic equation for u 
when v = 0 has two roots equidistant from u = a e /( 1 + D 2 / 2), so the center of the ellipse is 

located at u = a e /( 1 + D 2 / 2) ; v = 0. The v semi-axis is D/yj 1 + D 2 /2 and the u semi-axis 

is [D/( 1 + D 2 1‘2^\yj\^rjy 2 l2 — c^j2. (A more thorough and detailed derivation is provided in 

Appendix O) 

Since the derivation of probabilities in the earlier section included neglecting D 2 compared with 
unity, it is not inconsistent to do the same here. The result is simpler than might have been 
expected. The ellipse is centered on the data point and has axes of 2D and 2D/\ e . The data point 
and the candidate have changed places. Contour lines around the data point (in the U'Q' plane) 
with a specified value of D are given by: 

D 2 = (it — a e ) 2 A 2 + v 2 (33) 

where u = ac cos(2r/) and v = ac sin(2r/) as defined in Equation (31), and 2r/ is the angle between 
the data point and the point on the contour. 

One can then derive a description of probabilities similar to the situation of a known incident 
amplitude. The area of an ellipse with a given value of D would be just nD 2 /A, and the incremental 
area between ellipses for D and D + dD would be proportional to DdD. An integral from D to 
infinity is then proportional to exp(— D 2 /2a' 2 ). Normalization makes the probability outside the 
contour equal to exp(— D 2 /2a' 2 ). 
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Figure 5 shows the implication of the results for a particular estimated polarization from a mea¬ 
surement sample. The marginal probability of U' and Q' derived in Section 1 is in terms of (N) for 
many samples. For a single sample, one has only the N for that sample. The probability depends 
on ( N)D 2 and is only significant for small D 2 . The fractional error in D 2 should be 1 /y/N. 

It seems likely that in any practical situation there will be large enough additional uncertainties 
in data acquisition not included in this statistical analysis that the difference in the contours for 
approximate or full inclusion of D 2 dependence would not be important. 

It has been mentioned that statistical variations in the recorded counts result in a positive probability 
for any pair of U' and Q' values, even improbable ones which might lead to amplitude estimates that 
are larger than 1, while the true amplitude can only be zero or positive and less than or equal to 1. 
This could complicate the correct conclusions that can be drawn from a single data set if the total 
number of counts is too low or the confidence level chosen leads to a formal contour that extends 
into regions where U' 2 + Q' 2 is greater than 1. The simplest way to avoid such problems is just 
to get more data, or choose a different confidence level, or both, so that the contour is completely 
within the acceptable range. But if this is not feasible or desirable, one could impose restrictions 
and provide a truncated confidence region. 


4. SUMMARY 

The methods presented here for generating simulated counts of scattered electrons at different 
azimuthal angles, and for analyzing the resulting angular distribution, have the special advantage 
of making it possible to derive rigorous results about the estimation of polarization amplitudes and 
directions and their uncertainties. Alternative methods for generating simulations, and for analyzing 
the angular distribution, should give results that are essentially identical to these, but some of the 
predicted properties may only be observed rather than derived. 

We confirm previous work on amplitude and angle estimates for cases in which the incident amplitude 
is not too large. With our approach we are able to provide analytical treatment for larger incident 
amplitudes, so that the entire physical range of possibilities is covered. 


We thank Tod Strohmayer, Tim Kallman, and Phil Kaaret for detailed discussions. 
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A. THE DISTRIBUTIONS OF U fl Q f AND OF U',Q' 

It is useful to introduce another variable, linear in Uf and If. Because (U /) = ao (If), the mean 
value of (Uf — a^If) is zero and because the covariance of If with U f is just ao times the variance 
of I f , the covariance of (Uf — (Milf) with If is zero. The variance of (Uf — a^If) is aj (1 — Oq/ 2). 
We introduce the quantity 

A = 1/^1-a 2 /2 (A.l) 

and define the new variable 

V f = A (U f - a 0 //). (A.2) 

Vf has a mean value of zero and the same variance as Qf and is uncorrelated with I / and Q /. 

The three variables, If , Vf, and Qf are trivariate normal, and mutually uncorrelated, so they are 
independent. Then the joint probability distribution function is given by 

Pi,v,Q(If , Vf, Qf) = (9 J^ 3/2 exp [-^2 (2 (If - (If)) 2 + Vf + Q 2 )]- (A.3) 


What we would like to have is the joint probability of Q' = Qf/If and V' = Vf/If = \(U' — ao). To 
obtain this, we transform to V' and Q' as our variables, and integrate the probability of If, V', Q' 
over If. For the transformation we have dV'dQ' = I 2 dVfdQf. Since V' and Q' only appear in the 
combination 

D 2 = V' 2 + Q' 2 (A.4) 

and (If) = (N)/(2n) = tto/, the integral to be evaluated is 

/o r oo J 2 

Pv'Q'(V, Q') = V / dlfl 2 exp[ 4(1 + D 2 /2) + 2nl f - ttV/ 2 ], (A.5) 

[Ziraf) t J—oo ®f 

which can be evaluated exactly. It is convenient to introduce 


/2 _ 1 _ 2 
7T 2 CJ 2 (N) 


(A.6) 


. Then 


Pv'Q'(V , Q') 


1 + (1 + D 2 /2)/(N) 1 D 2 

(l + D 2 / 2) 5 /2 2k a' 2 6XP 2cr' 2 


(A.7) 


In any practical application, the value of (N) will be so large that the term proportional to 1 /(N) 
can be neglected in comparison with unity. Moreover, the values of D that might be of interest will 
be small enough to justify replacing (1 + D 2 /2) with unity. Otherwise, the probabilities involved 
are so small that they would be of no value. Accordingly the initial fraction in Eauation (IA.7l) can 
be dropped, leaving 

P,v(V",0') = ^Uex P (-A). 

The distribution of D 2 is just a chi-squared distribution with two degrees of freedom. 


(A.8) 
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Thus it is clear that 

Pu'Q'(U',Q') = ^ 72 ex P(-^72)- ( A -9) 

This is the form in which U' and Q' are approximately independent normal variables with the 
variance in U' lower than that of Q' by 1 — Oq/ 2. These results for the variance of U' and Q' agree 
with the estimates given by considering ( 5 2 U') and ( 5 2 Q '), where expanding around the mean values 
gives 5U' = 6Uf/(If) — (Uf)5If/(If) 2 and similarly for 5Q r . 

If cf >o f 0, the following means and covariances have a 4 >o dependence: 

(Uf) = ^p-a 0 cos(2(j) 0 ), (A.10) 

Ztt 

(N) 

(Qf) = -^aosin(20 o ), 

(N) 

(5Uf5If) = -j-^-a 0 cos(2(/> 0 ), 

(N) 

(5Q f 5I f ) = ^ja o sin(20 o ). 

It remains true that 

(SU f 5Q f )= 0. 

With 

X = C/cos(2()>o) + Q sin(2()>o), (A.11) 

Y = —U sin(2()>o) + Qcos(20 o ), 

Z = \(X-aoI f ), 

corresponding to Equation (lA.3j) . we have 

P, zr (If. Zj.Yf) = (7^§57 2 ex P[-i( 2 < 7 / - < 7 /» 2 + Z 'f + Y f "> 1- < A ' 12 ) 

X, Y are just a set of axes rotated from U, Q by 2 (f>Q. Since 

U = a e cos(20 e ) 

Q = a e sin(2<)> e ), 

we have 


X = a e cos(2(<^> e - 4> 0 )) 
Y = a e sin(2(^> e - 0 O )). 


The probability only depends on the angle between a e and the true polarization a q, which is 
277 = 2 (( j) e - ( j ) 0). 
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With X' = X/I ', Y' = y//', and D 2 = A 2 (A' — ao) 2 + W 2 the general result is derived as before, 
and similarly, for large ( N ), 

P x ,y, { X'X)^Y_ a p(-^)- (A.13) 


B. COMPUTATIONAL REMARKS 


Numerical implementation of the methods presented here for generating and analyzing simulations 
was done primarily with a simple Fortran program (available from the authors,) which can produce 
results for thousands of samples with thousands of counts each, in a few seconds on modest hardware. 


Other simulations were done using IDL with version 8.2 RANDOMU to generate Poisson variates, 


Strohmayer & Kallman 

(1 

1013, 

Paper 1). To generate the rij in this case the method of Marsaglia 

was used ( 

Zelin Sz Severo 

196' 

1). These simulations were carried out to establish that the difference 


between the U and Q variances was due to the difference between the unweighted and weighted 
leas t-sq uares fits^ The fits were carried out using IDL and the fitting routines based on MINPACK-1 
( Markwardt 2009 1. 


The locus of points a e and p that satisfy Equation (l24l) can be obtained parametrically. Defining p 
by 

a e = a 0 + p (B.l) 

and /3 as the angle between p and ao, a e cos(2y) = ao + pcos(/3 ) and a e sin(2y) = psin(/3). Then 
Equation ([Ml) gives p = D /y 7 sin(/3 2 ) + cos(/3 2 )/(l — ag/2) for 0 < /3 < 2n. We then calculate 

°o + P 2 + 2 aoPcos(/3), (B.2) 

(a 0 + pcos(/3)) 

CLe 

psin(/3) 
d e 

arctan 2(cos(2x), sin(2y)). 


cos(2\) = 

sin(2x) = 
2x = 


A similar parametric construction gives the contours of candidate true values of amplitude and 
angle, given measured values, as discussed in Section 3. 
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C. DERIVATION OF CANDIDATE CONTOURS 


The probability of a particular measured polarization, given a known true polarization, depends 
only on D 2 (Equation (|24l) in the text), where 


D 2 


[a e cos(2r?) - q 0 p 
1 — a o/2 


+ a 2 sin 2 (27/). 


(C.l) 


We wish to determine the candidate true polarizations ac at angle 2i] from a measured polarization 
amplitude and direction that would have a particular probability. The values must obey the same 
equation with ao —>• ac- The candidate point Zq has the projections u = ac cos 2tj and v = ac sin 2?/ 
parallel and perpendicular to a e . Then, using these substitutions and a 2 , = u 2 + v 2 , Equation (13211 
in the text is obtained, which can be written as 


D 2 — a 2 — (u 2 + v 2 ){l + D 2 12) + a 2 7- + 2 a e u = 0. 

The following sequence of reorganizations: 

(D 2 ~ Qe) _ 2 _ 2 , a W /‘ 1 , n 

1 + D 2 /2 l + D 2 /2 l + D 2 /2 ’ 


(C.2) 


(C.3) 


(■D 2 ~ «e 2 ) 
1 + D 2 /2 


D 2 


- («- 


( 1 - 


i ) 2 + 


1 + D 2 /2 (1 + D 2 /2) 2 

1 , CLp. 


-v 2 (l - 


all 2 


1 + D 2 /2 


= 0, 


1 + D 2 /2 l + D 2 /2 y 1 + D’ 2 /2 


- [u - 


1 + D 2 /2 


) 2 -u 2 (l- 


a ^/2 


1 -YD 2 12 


= 0, 


« 2 


leads to 


1 + D 2 /2 K 1 + D 2 /2 

D 2 




ain 


= (u- 


1 + D 2 /2 v l + D 2 /2 

all 2 


= 0. 


:) 7 ( 1 - 


-) + v 2 . 


(C.4) 

(C.5) 

(C.6) 

(C.7) 


l + D 2 /2 v 1 + D 2 /2 7 v 1 + D 2 /2 2 

This is the ellipse centered on a e /(l + D 2 / 2),0 with semi-axes (D/( 1 + D 2 /2))^/Y^al 2 j2^+ly 2 j2 
and DI\J 1 + D 2 /2 for u and v, respectively. Now neglecting D 2 /2 compared to 1, 


2= (U-Oel 2 

l-«72 + 


(C.8) 
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Fig. 1.— (top) U' and Q' for 4 sets of 100 simulations each for < N >= 1000, with the true 
polarization along the Uf axis. Points from the different sets are indicated by triangles of different 
orientation. The 68.3 % probability contour is centered on the point Z at ao = 0.1,0. (bottom) 
The polarization amplitude a e and angle (f> e corresponding to the U' and Q' results. (f> e is half the 
angle between the U' axis and the direction to the point U',Q'. 
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Fig. 2.— (top) Marginal distributions of Uf and Qf for 20,000 simulations with < N >= 4000 for 
a polarization a q = 3/4 along the Uf axis. The theoretical distributions are the same, (bottom) 
Marginal distributions of U' and Q' for 20,000 simulations with < N >= 8000 for a polarization 
do = 3/4 along the Uf axis. Here the ordinate is the number of simulations in a bin. The curves 
are the predictions for independent normal distributions. 
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Fig. 3.— (top)Distribution of JJ' and Q' for 20,000 simulations (the same as for Fig 2, bottom) 
with < N >= 8000, ao = 3/4, and </>o = 0. Contours of the predicted 1, 2, and 3 sigma levels 
(68.27, 95.45, and 99.73 %) are superposed. 13597, 19010, and 19935 simulations fell within those 
contours, in comparison to 13654, 19090, and 19946 expected. The semi-major axes are along the 
Q' axis and the semi-minor axes along the U' axis, centered on ao,0. (bottom) The corresponding 
a e and </> e with their theoretical contours. 






Fig. 4.— Distribution in D 2 for a simulation with ao = 3/4 and < N >= 4000, for 
Nsample = 20,000. The number of samples in a D 2 increment AD 2 = 10 , is plotted, nor¬ 
malized by N sample AD 2 {= 2), together with the expected values. For this case l/2cr /2 = 1000. The 
probability for the measured polarization to lie outside of D 2 matches the predicted exponential. 
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Fig. 5.— Examples of contours for the candidate true polarization quantities for a measurement of 
a e = 0.40 and (f) e = 20 degrees, (top) Contour for 68.3 % (theoretical) confidence in U'Q' space 
for the true polarization, for D = 0.05. The dashed line indicates a candidate true polarization at 
distance ac from the origin and angle 2r] from the measured polarization, (bottom) Contours (solid 
and dashed lines, respectively) of 1 sigma (68.3 %) and 3 sigma (99.7 %) for the true amplitude and 
angle, for N = 9500 events. 





